NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Planet Microbe: a platform for marine microbiology to discover and analyze interconnected ‘omics and environmental data

https://doi.org/10.1093/nar/gkaa637

Ponsero, Alise J; Bomhoff, Matthew; Blumberg, Kai; Youens-Clark, Ken; Herz, Nina M; Wood-Charlson, Elisha M; Delong, Edward F; Hurwitz, Bonnie L (July 2020, Nucleic Acids Research)
null (Ed.)
Abstract In recent years, large-scale oceanic sequencing efforts have provided a deeper understanding of marine microbial communities and their dynamics. These research endeavors require the acquisition of complex and varied datasets through large, interdisciplinary and collaborative efforts. However, no unifying framework currently exists for the marine science community to integrate sequencing data with physical, geological, and geochemical datasets. Planet Microbe is a web-based platform that enables data discovery from curated historical and on-going oceanographic sequencing efforts. In Planet Microbe, each ‘omics sample is linked with other biological and physiochemical measurements collected for the same water samples or during the same sample collection event, to provide a broader environmental context. This work highlights the need for curated aggregation efforts that can enable new insights into high-quality metagenomic datasets. Planet Microbe is freely accessible from https://www.planetmicrobe.org/.
more » « less
Full Text Available
Libra: scalable k-mer based tool for massive all-vs-all metagenome comparisons

https://doi.org/10.1093/gigascience/giy165

Choi, Illyoung; Ponsero, Alise J; Bomhoff, Matthew; Youens-Clark, Ken; Hartman, John H; Hurwitz, Bonnie L (December 2018, GigaScience)

Full Text Available
Libra: Improved Partitioning Strategies for Massive Comparative Metagenomics Analysis

https://doi.org/10.1145/3217880.3217882

Choi, Illyoung; Ponsero, Alise J.; Youens-Clark, Ken; Bomhoff, Matthew; Hurwitz, Bonnie L.; Hartman, John H. (June 2018, Proceedings of the 9th Workshop on Scientific Cloud Computing)

Big-data analytics platforms, such as Hadoop, are appealing for scientific computation because they are ubiquitous, well-supported, and well-understood. Unfortunately, load-balancing is a common challenge of implementing large-scale scientific computing applications on these platforms. In this paper we present the design and implementation of Libra, a Hadoop-based tool for comparative metagenomics (comparing samples of genetic material collected from the environment). We describe the computation that Libra performs and how that computation is implemented using Hadoop tasks, including the techniques used by Libra to ensure that the task workloads are balanced despite nonuniform sample sizes and skewed distributions of genetic material in the samples. On a 10-machine Hadoop cluster Libra can analyze the entire Tara Ocean Viromes of ~4.2 billion reads in fewer than 20 hours.
more » « less
Full Text Available

Search for: All records